Reinforcement theory - PDFSEARCH.IO - Document Search Engine

Reinforcement theory
Results: 290

#	Item
11	Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA Add to Reading List Source URL: psthomas.com Language: English - Date: 2012-10-01 18:27:53 Statistics Statistical theory Estimation theory Dynamic programming Markov decision process Stochastic control Bias of an estimator Reinforcement learning Loss function Fisher information
12	POVERTY AND SELF-CONTROL B. Douglas Bernheim Stanford University and NBER Debraj Ray New York University Add to Reading List Source URL: thred.devecon.org Language: English - Date: 2014-06-28 16:47:12 Game theory Behavior Decision theory Psychology Nash equilibrium Behavioral economics Norm Reinforcement Dynamic inconsistency Trembling hand perfect equilibrium Subgame perfect equilibrium Economic equilibrium
13	Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a Add to Reading List Source URL: www.hieratic.eu Language: English Computational complexity theory Theory of computation Dynamic programming Markov decision process Stochastic control Analysis of algorithms Mathematical logic Reinforcement learning Time complexity Algorithm PP
14	PREDICTING WHEN TO LAUGH WITH STRUCTURED CLASSIFICATION Bilal Piot1 , Olivier Pietquin2 , Matthieu Geist1 1 SUPELEC IMS-MaLIS research group and UMIGeorgiaTech - CNRS) 2 Add to Reading List Source URL: www.metz.supelec.fr Language: English - Date: 2014-07-15 03:12:51 Theoretical computer science Machine learning Learning Statistical classification Support vector machine Laughter K-nearest neighbors algorithm Algorithm Computational learning theory Artificial neural network NP Reinforcement learning
15	Bandits all the way down: UCB1 as a simulation policy in Monte Carlo Tree Search Edward J. Powley, Daniel Whitehouse, and Peter I. Cowling Department of Computer Science York Centre for Complex Systems Analysis Universit Add to Reading List Source URL: eldar.mathstat.uoguelph.ca Language: English - Date: 2016-07-12 12:05:04 Monte Carlo methods Combinatorial game theory Monte Carlo tree search Statistical mechanics General game playing Reinforcement learning Simulation Thomas Nast Artificial intelligence
16	Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier Add to Reading List Source URL: www.intelligence.tuc.gr Language: English - Date: 2009-03-02 16:24:03 Game theory Reinforcement learning Nash equilibrium Q-learning Strategy Partially observable Markov decision process Action selection Best response Bellman equation Zero-sum game Agent-based model Solution concept
17	RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning Dharmashankar Subramanian IBM T. J. Watson Research Center Yorktown Heights, NY 10598 Add to Reading List Source URL: marek.petrik.us Language: English - Date: 2016-07-14 09:59:52 Algebraic geometry Field theory Valuation Reinforcement learning Differential topology
18	Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers Add to Reading List Source URL: arxiv.org Language: English - Date: 2014-02-04 20:03:22 Operations research Dynamic programming Mathematical optimization Equations Decision theory Reinforcement learning Markov decision process Bellman equation Policy Partially observable Markov decision process
19	Inverse Reinforcement Learning for Interactive Systems∗ [Extended Abstract] Olivier Pietquin SUPELEC - UMIGeorgiaTech-CNRS) 2 rue Edouard BelinMetz - France Add to Reading List Source URL: www.ilhaire.eu Language: English - Date: 2013-10-03 05:33:46 Machine learning Computational linguistics User interface techniques Multimodal interaction User interfaces Reinforcement learning Apprenticeship learning Computational learning theory Speech recognition Intelligent agent Dialog system Dialog manager
20	1 On Stochastic Feedback Control for Multi-antenna Beamforming: Formulation and Low-Complexity Algorithms Sun Sun, Min Dong, and Ben Liang Add to Reading List Source URL: www.comm.utoronto.ca Language: English - Date: 2014-05-05 14:44:36 Markov processes Markov models Mathematical optimization Stochastic control Dynamic programming Markov decision process Beamforming Reinforcement learning Optimal control Markov chain Q-learning Control theory